AD-A225  274 


Graphical  Inference  in  Qualitative  Probabilistic  Networks 


Michael  P.  Wellman 

Artificial  Intelligence  Technology  Office 


June  1990 


Final  Report  for  Period  March  1989  -  June  1990 


Approved  for  public  release;  distribution  is  unlimited 


t 


i 


Technology  Exploitation  Directorate 
Wright  Research  and  Development  Center 
Air  Force  Systems  Command 

Wright.-Patterson  Air  Force  Base,  Ohio  45433-6523 


*  t  a 


n  .  . 

£  LeCTE'i 

AUG  10  1900 


90  08 


NOTICE 


WHEN  GOVERNMENT  DRAWINGS,  SPECIFICATIONS,  OR  OTHER  DATA  ARE  USED  FOR  ANY 
PURPOSE  OTHER  THAN  IN  CONNECTION  WITH  A  DEFINITELY  GOVERNMENT-RELATED 
PROCUREMENT,  THE  UNITED  STATES  GOVERNMENT  INCURS  NO  RESPONSIBILITY  OR  ANY 
OBLIGATION  WHATSOEVER.  THE  FACT  THAT  THE  GOVERNMENT  MAY  HAVE  FORMULATED  OR  IN 
ANY  WAY  SUPPLIED  THE  SAID  DRAWINGS,  SPECIFICATIONS,  OR  OTHER  DATA,  IS  NOT  TO 
BE  REGARDED  BY  IMPLICATION,  OR  OTHERWISE  IN  ANY  MANNER  CONSTRUED,  AS  LICENSING 
THE  HOLDER,  OR  ANY  OTHER  PERSON  OR  CORPORATION;  OR  AS  CONVEYING  ANY  RIGHTS  OR 
PERMISSION  TO  MANUFACTURE,  USE,  OR  SELL  ANY  PATENTED  INVENTION  THAT  MAY  IN  ANY 
WAY  BE  RELATED  THERETO. 

THIS  REPORT  HAS  BEEN  REVIEWED  BY  THE  OFFICE  OF  PUBLIC  AFFAIRS  (ASD/PA) 
AND  IS  RELEASABLE  TO  THE  NATIONAL  TECHNICAL  INFORMATION  SERVICE  (NTIS).  AT 
NTIS  IT  WILL  BE  AVAILABLE  TO  THE  GENERAL  PUBLIC  INCLUDING  FOREIGN  NATIONS. 

THIS  TECHNICAL  REPORT  HAS  BEEN  REVIEWED  AND  IS  APPROVED  FOR  PUBLICATION. 


Michael  P.  Wellman,  Capt,  USAF 
Project  Engineer 

Artificial  Intelligence  Tech.  Office 


FOR  THE  COMMANDER 


William  R.  Baker 

Chief,  Artificial  Intelligence 

Technology  Office 


IF  YOUR  ADDRESS  HAS  CHANGED,  IF  YOU  WISH  TO  BE  REMOVED  FROM  OUR  MAILING 
LIST,  OR  IF  THE  ADDRESSEE  IS  NO  LONGER  EMPLOYED  BY  YOUR  ORGANIZATION  PLEASE 
NOTIFY  WRnr/Txi  WRIGHT-PATTERSON  AFB,  OH  45433-  ^23  TO  HELP  MAINTAIN 

A  CURRENT  MAILING  LIST. 

COPIES  OF  THIS  REPORT  SHOULD  NOT  BE  RETURNED  UNLESS  RETURN  IS  REQUIRED  BY 
SECURITY  CONSIDERATIONS,  CONTRACTUAL  OBLIGATIONS,  OR  NOTICE  ON  A  SPECIFIC 
DOCUMENT. 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  Of  THIS  PAGE 


la  REPORT  SECURITY  CLASSIFICATION 

Unclassified 


2a  SECURITY  CLASSIFICATION  AUTHORITY 


2b  DECLASSIFICATION /DOWNGRADING  SCHEDULE 


4  PERFORMING  ORGANIZATION  REPORT  NUMBER(S) 

WRDC-TR-90-6002 


REPORT  DOCUMENTATION  PAGE 


lb  RESTRICTIVE  MARKINGS 


Form  Approved 
OMB  No  0704  0188 


3  DISTRIBUTION /AVAILABILITY  OF  REPORT 

Approved  for  Public  Release;  distribution  is 

unlimited. 


5  MONITORING  ORGANIZATION  REPORT  NUMBER(S) 


fid  NAME  OF  PERFORMING  ORGANIZATION 

Al  Technology  Office 

6b  OFFICE  SYMBOL 
(If  applicable) 
WRDC/TXI 

7a  NAME  OF  MONITORING  ORGANIZATION 

6c  ADDRESS  {City,  State,  and  7  IP  Code) 

WPAFB  OH  45433-6523 

7b  ADDRESS  (City,  State,  and  ZIP  Code) 

8d  NAME  OF  FUNDING /SPONSORING 
ORGANIZATION 

8b  OFFICE  SYMBOL 
(If  applicable) 

9  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 

1  8c  ADDRESS  (Ofy,  State,  and  ZIP  Code)  1 

10  SOURCE  OF  FUNDING  NUMBERS 

PROGRAM 
ELEMENT  NO 

6.2201F 


TASK 

WORK  UNIT 

NO 

ACCESSION  NO 

05 

54 

11  TITLE  (Include  Security  Classification) 

Graphical  Inference  in  Qualitative  Probabilistic  Networks 

12  PERSONAL  AUTHOR(S) 
Wellman,  Michael  P. 

Ha  TYPE  Ol  REPORT 

13b  TIME  COVERED 

14  DATE  OF  REPORT  (Year.  Month,  Day) 

15  PAGE  COUNT 

Final 

FROM  1  Mar  89  TO  Jun  90 

1990  June 

35 

COSATI  CODES 


SUB-GROUP 


18  SUBJECT  TERMS  < Continue  on  reverse  if  necessary  and  identify  by  block  number) 

'  I  .  '1/ 


19  ABSTRACT  (Con^/nue  on  reverse  if  necessary  and  identify  by  block  number) 

Qualitative  probabilistic  networks  (QPNs)  are  abstractions  of  influence  diagrams  that 
encode  constraints  on  the  probabilistic  relation  among  variables  rather  than  precise 
numeric  distributions.  Qualitative  relations  ex  press  monotonicitv  constraints  on  direct 
probabilistic  relations  between  variables,  or  on  interactions  among  the  direct  relations. 
Like  their  numeric  counterpart,  QPNs  facilitate  graphical  inference:  methods  for  de¬ 
riving  qualitative  relations  of  interest  via  graphical  transformations  of  the  network 
model.  However,  query  processing  in  QPNs  exhibits  computational  properties  quite 
different  from  basic  influence  diagrams.  In  particular,  the  potential  for  information 
loss  due  to  the  incomplete  specification  of  probabilities  poses  the  new  challenge  of  min¬ 
imizing  ambiguity.  Analysis  of  the  properties  of  QPN  transformations  reveals  several 
characteristics  of  admissible  graphical  inference  procedures.  f\t'<  1  ' 


|  Z0  DISTRIRUT ION /  AVAILABILITY  OF  ABSTRACT 

QlJNCI  ASSIEIE  D/UNLIMITED  □  SAME  AS  RPT  □  DTIC  USERS 

21  ABSTRACT  SECURITY  CLASSIFICATION 

22a  NAME  Ol  RESPONSIBLE  INDIVIDUAL 

Wellman,  Michael  P. 

22b  TELEPHONE  (Include  Area  Code) 

(513)255-5800 

22c  OTFICE  SYMBOL 

WRDC/TXI 

00  Form  1473,  JUN  86 


Previous  editions  are  obsolete 


_SL  CLIRIT  Y  CLASSIFICATION  Ol  T  M  is  PAUL 

UNCLASSIFIED 


1  Q  UA  LIT  AT IV E  PROBABILISTIC  NETWORKS 

1  Qualitative  Probabilistic  Networks 

The  modeling  advantages  of  influence  diagrams  are  largely  attributable  to  the  graphical 
nature  of  the  representation,  which  highlights  the  important  relationships  among  the  domain 
variables.  As  in  the  related  belief- net  work  representation  [5],  the  structure  of  an  influence 
diagram  reveals  the  dependencies  (and  implicitly,  the  independencies)  operating  among  the 
variables  of  interest.  Furthermore,  transformation-based  algorithms  for  evaluating  influence 
diagrams-  -such  as  Shachter’s  [8] — preserve  the  model’s  structural  properties  at  each  stage 
of  the  computation.  Each  transformation  step  corresponds  to  an  intuitive  operation  on  the 
variables  (“averaging  out”  or  “belief  revision”).  Such  graphical  inference  is  attractive  because 
the  operators  may  be  arranged  flexibly  to  answer  a  variety  of  queries  [9]. 

These  desirable  properties  of  influence  diagram  models  emerge  from  the  representation’s 
structural  features  alone  and  do  not  depend  on  the  precise  specification  of  conditional  prob¬ 
abilities.  Given  this  observation,  it  is  natural  to  investigate  other  decision  model  represen¬ 
tations  that  share  the  structural  features  of  influence  diagrams  but  support  a  different  class 
of  inferences  about  the  relationships  among  decisions  and  events.  In  particular,  we  are  in¬ 
terested  in  finding  representations  that  are  more  robust  than  numerical  decision  models,  yet 
still  provide  decision-theoretic  justification  for  some  choices  among  competing  plans. 

Qualitative  probabilistic  networks  (QPNs)  [12]  are  influence  diagrams  with  the  numeric 
conditional  probability  tables  replaced  by  qualitative  probabilistic  relations  among  the  vari¬ 
ables.  Those  qualitative  relations  can  be  viewed  as  constraints  on  the  conditional  probability 
tables,  expressing  inequalities  that  must  be  satisfied  by  the  various  elements.1 

The  node  types  and  topological  dependence  properties  of  QPNs  and  influence  diagrams 
are  identical.  Because  qualitative  relations  are  weaker  than  numeric  conditional  probability 
tables,  QPNs  are  abstractions  of  influence  diagrams.  While  an  influence  diagram  represents 
a  particular  joint  distribution  among  its  random  variables,  a  QPN  represents  a  family  of 
such  distributions  satisfying  its  constraints.  Without  a  unique  joint  distribution,  of  course, 
it  is  not  generally  possible  to  determine  the  optimal  decision.  Rather,  the  aim  of  inference 
in  QPNs  is  to  derive  constraints  on  the  form  of  the  optimal  decision  policy. 

2  Qualitative  Relations 

The  constraints  on  probabilistic  relations  expressible  in  QPNs  are  not  arbitrary.  In  fact, 
QPNs  permit  only  very  regular  patterns  of  inequalities  to  be  asserted  about  conditional 
probabilities.  All  qualitative  relations  take  the  form  of  conditions  on  the  comparative  prob¬ 
ability  distribution  for  a  variable  given  various  values  for  its  predecessors  in  the  network, 
bet  Fr  denote  the  cumulative  probability  distribution  for  rasa  function  of  its  predecessors. 

'The  term  “table”  here  is  somewhat  misleading  because  QPN  variables  need  not  be  discrete. 
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2  QUALITATIVE  RELATIONS 


This  distribution  is  typically  represented  in  an  influence  diagram  by  a  conditional  proba¬ 
bility  table.  In  QPNs,  qualitative  relations  constrain  the  value  of  Fc  without  specifying  it 
precisely.  Relations  of  one  type,  called  qualitative  influences ,  restrict  the  relative  values  of 
/'V  upon  variation  of  one  of  c’s  predecessors.  Qualitative  synergies  constrain  the  behavior  of 
this  distribution  upon  c  hanges  of  two  predecessors  at  once.  Roth  types  of  relation  can  be 
interpreted  as  probabilistic  versions  of  monotonicity  conditions  on  the  partial  derivatives, 
first-  and  second-order,  respectively. 

Specifically,  a  qualitative  influence  from  a  to  c  with  sign  +  (  — )  means  that  the  probability 
distribution  for  c  given  a  is  nondecreasing  (nonincreasing)  in  a — all  else  equal — in  the  sense 
of  first-order  stochastic  dominance.  The  notation  Ss(a,  c),  <5  6  {  +  ,  — ,  0,  ?},  is  used  to  denote 
such  an  influence. 

Definition  1  (qualitative  influences)  Ss(a,c)  holds  in  a  qualitative  probabilistic  network 
G  if  and  only  if  (iff),  for  all  values  Cq  of  c,  x  of  c’s  predecessors  in  G  other  than  a,  and 
a i  >  a2  of  a: 

Ec(co\a\x)  Rs  Fc(co\a2x), 

where  R6  is  <>  or  —  as  6  is  -f,  — ,  or  0,  and  R ?  is  the  complete  relation  (thus  57  always 
holds ) . 

Such  a  complex  specification  of  a  rather  straightforward  concept  is  necessary  for  complete¬ 
ness.  The  condition  that  the  first  cumulative  distribution  be  no  greater  than  the  second  for 
all  values  of  c  is  a  requirement  of  first-order  stochastic  dominance.  (Lower  values  for  the 
cumulative  distribution  correspond  to  higher  probabilities  for  the  larger  values.)  Quantifi¬ 
cation  over  all  values  for  c’s  predecessors  (represented  by  the  variable  x,  typically  a  vector) 
realizes  the  “all  else  equal”  part  of  the  definition.  Fixing  all  variables  but  one  corresponds 
to  the  usual  interpretation  of  partial  derivatives. 

The  qualitative  synergy  relation  is  intended  to  capture  the  intuition  that  an  increase  in 
one  variable  has  greater  effect  at  higher  levels  of  the  other.  In  other  words,  the  combined 
effect  of  increasing  the  variables  is  greater  than  taking  the  two  effects  independently.  One 
way  to  define  this  probabilistically  is  demonstrated  by  the  following  qualitative  synergy  (Vs) 
condition. 

Definition  2  (qualitative  synergies)  V^({a,6},c)  holds  in  a  QPN  G  iff  for  all  values  c0 
of  (  fiance  node  c,  x  of  c  ’s  predecessors  in  G  other  than  a  and  b,  a |  >  a2  of  a,  and  b j  >  b2  of 
h: 

[F’c(co|ai6ix)  -  Fc(c0|a261x)]  Rs  [Fc(q,|a162x)  -  F-(co|a262x)] . 

Qualitative  synergy  on  value,  Vs ( {a,  b) ,  v)  holds  iff  the  utility  function  u  satisfies  the  follow¬ 
ing  (with  the  same  conditions  on  a,  b,  and  x): 

[u(a2,  /»i,  x)  —  u(cq ,  6, ,  x)]  Rb  |u(a2,  fcj,  x)  -  u(a,,  63,  x)] .  (1) 
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3  EXAMPLE:  A  GENERIC  DECISION  MODEL 


A  QPN  is  represented  by  a  directed  acyclic  graph,  with  nodes  representing  variables  and 
signed  edges  and  hyper-edges  representing  qualitative  relations.  A  link  from  a  to  h  with 
sign  />  denotes  an  assertion  that  Hy  convention,  ,S'°  assertions  are  implicit  in  the 

absence  of  a  link.2  It  follows  from  the  probabilistic  definit  ions  that  any  variable  without  an 
influence  link  to  some  target  is  also  non-synergistic  (T°)  with  any  other  variable  on  that 
target.  However,  the  absence  of  a  synergy  hyper-edge  for  variables  with  nonzero  influences 
to  a  target  does  not  imply  anything  about  their  interaction  or  non-interaction.  Lacking  an 
explicit  synergy,  we  adopt  the  conservative  default,  Y7 . 

The  probabilistic  definitions  of  the  qualitative  relations  were  chosen  for  two  primary 
properties.  First,  the  relations  are  preserved  by  certain  graphical  inference  operations.  These 
are  discussed  in  some  depth  in  Section  4  below.  Second,  qualitative  relations  involving 
decision  variables  and  the  value  node  constrain  the  form  of  the  optimal  decision  policy.  The 
example  of  Section  3  demonstrates  these  decision-theoretic  implications  of  QPNs.  See  [12] 
for  further  motivation  and  discussion  of  the  mathematical  properties  of  qualitative  influences 
and  synergies. 

3  Example:  A  Generic  Decision  Model 

A  significant  advantage  of  the  qualitative  formalism  is  that  the  relations  are  valid  for  a  wide 
range  of  contexts  and  interpretations  of  the  variables.  In  support  of  this  assertion,  Figure  1 
presents  the  QPN  representation  of  a  generic  decision  problem. 

The  model  of  Figure  1  includes  two  decision  variables,  representing  the  choices  of  obser¬ 
vations  and  actions.  These  variables  may  be  propositional  (whether  to  perform  the  action), 
or  may  admit  a  range  of  values  on  an  ordered  scale.  The  “act”  has  both  costs  and  benefits, 
each  positively  related  to  the  degree  to  which  the  act  is  performed.  For  example,  both  the 
therapeutic  results  (benefits)  and  undesirable  side  effects  (costs)  would  increase  in  the  sense 
of  S+  with  the  aggressiveness  of  a  medical  treatment  (act).  By  definition,  costs  negatively 
influence  the  value  node  whereas  the  influence  of  benefits  is  positive. 

There  is  also  an  unknown  “state  of  nature,”  9 ,  which  affects  the  valuation  of  costs  and 
benefits.  Although  it  makes  no  commitment  about  the  direct  effect  of  0  on  value  (S  (9,v)), 
the  generic  decision  model  requires  that  the  state  of  nature  be  positively  synergistic  with 
both  costs  and  benefits.  The  synergy  with  benefits  implies  that  the  beneficial  effects  of  the 


action  are  enhanced  for  higher  values  of  0.  For  example,  if  0  represents  underlying  disease 
severity  arid  the  bentfits  of  therapy  reflect  degree  of  cure,  then  the  synergy  captures  the  r 
idea  that  a  cure  has  greater  value  to  patients  whose  diseases  are  more  severe.  The  positive 
synergy  with  costs  rules  out  the  possibility  that  9  exacerbates  the  negative  effects  of  the 

‘See  Pearl  [5]  for  a  detailed  treatment  of  independence  implicit  in  the  topology  of  a  directed  probabilist  ic 
network. 
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3  EXAMPLE:  A  GENERIC  DECISION  MODEL 


Figure  1.  The  generic  decision  model. 

actions.  In  the  therapy  example,  this  implies  that  disease  severity  is  at  worst  neutral  with 
respect  to  the  side  effects  of  the  treatment. 

Although  the  state  0  is  not  directly  observable  by  the  decision  maker,  there  is  another 
variable,  info ,  which  is  related  to  9  by  a  positive  influence.  The  dashed  line  from  the  decision 
variable  observe  indicates  that  if  the  observation  is  performed,  the  value  of  info  is  accessible 
to  the  decision  maker  for  use  in  choosing  a  value  for  the  action.  The  act  of  observing  has 
costs,  but  no  benefits  other  than  the  value  of  information  revealed  in  this  way.3 

The  generic  decision  model  is  clearly  inadequate  to  justify  a  particular  choice  for  the 
decision  variable  act.  However,  the  model  is  strong  enough  to  provide  useful  constraint 
on  the  optimal  action  policy  as  a  function  of  the  information  available.  Using  the  graphical 
inference  operations  described  in  Section  4,  the  query  algorithm  transforms  the  original  QPN 
into  the  reduced  model  of  Figure  2. 

As  expected,  the  model  fails  to  establish  a  nondegenerate  relationship  between  act  and 
value.  The  useful  result  of  Figure  2  is  the  positive  synergy  Y+({info,  act},  v).  Positive 
synergy  imposes  a  constraint  on  the  utility  function  (1)  that  is  sufficient  to  establish  the 
monotone  decision  property  [7]  on  act  and  info.  This  property  dictates  that  the  optimal 

3Tlic  dashed  line  is  an  auxiliary  notational  element  not  formally  part  of  the  qualitative  probabilistic 
network.  This  convention  runs  counter  to  the  usual  approach  in  influence  diagrams,  which  include  informa¬ 
tional  links  as  part  of  the  model.  In  an  influence  diagram,  performing  the  observation  would  actually  change 
the  state  of  info.  Since  QPNs  were  designed  to  be  used  within  the  context  of  a  general-purpose  planning 
program  [11],  I  chose  to  maintain  a  uniform  interpretation  for  observable  variables  and  relegate  the  task  of 
maintaining  observability  constraints  to  the  planner. 


EXAMPLE:  A  GENERIC  DECISION  MODEL 


Figure  2.  The  generic  decision  model  reduced  to  three  variables. 

value  of  act  is  a  nondecreasing  function  of  info.  Thus,  if  the  observation  is  performed 
and  the  information  is  available,  the  decision  maker  should  raise  the  level  of  act  when  the 
information  indicates  the  expected  benefits  are  greater  (and  costs  are  no  worse).  If  the 
decision  maker’s  policy  is  to  choose  act  =  act\  upon  observing  info  —  info1,  then  consistency 
requires  that  it  choose  a  value  for  act  no  greater  than  act\  when  it  observes  info2  <  infol.  If 
act  is  a  propositional  variable,  the  synergy  result  implies  that  the  optimal  policy  is  to  take 
the  action  iff  info  exceeds  some  threshold  value. 

Although  the  details  of  any  particular  decision  situation  will  provide  a  stronger  basis  for 
choosing  a  decision  policy,  the  generic  analysis  above  serves  to  bound  the  strategy  space  at 
a  very  high  level  of  abstraction.  Such  bounds  are  useful  for  restricting  the  search  for  optimal 
policies,  even  with  highly  incomplete  problem  specifications.  For  example,  we  have  applied 
the  generic  decision  model  to  justify  rules  for  critiquing  decision  trees  constructed  by  human 
analysts  using  a  general  decision-making  tool  [13].  Because  the  constraints  are  justified  by 
a  generic  model,  the  critiquer  needs  only  a  qualitative  understanding  of  the  domain-specific 
actions  and  events  included  in  the  model. 

The  fact  that  we  can  constrain  the  form  of  optimal  decision  policies  at  a  high  level  of 
abstraction  is  a  feature  unique  to  qualitative  decision  models.  Of  course,  the  literature  is 
full  of  generic  decision-theoretic  results  derived  from  abstract  mathematical  models.  The 
contribution  here  is  a  qualitative  decision-modeling  formalism  that  permits  such  results  to 
be  derived  automatically  via  a  uniform  inference  procedure. 
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4  GRAPHICAL  QPN  OPERATIONS 


4  Graphical  QPN  Operations 

The  key  idea  behind  Shachter’s  algorithm  for  influence  diagram  evaluation  [8]  is  that  of  truth- 
preserving  graphical  manipulations.  For  numeric  influence  diagrams,  truth-preserving  means 
that  the  joint  distribution  represented  by  the  revised  model  is  implied  by  the  structures  in  the 
original  model.  In  the  qualitative  context,  truth-preserving  means  that  qualitative  relations 
in  the  original  network  entail  qualitative  relations  on  the  joint  distribution  of  variables  in 
the  modified  network.  In  other  words,  any  joint  probability  distribution  consistent  with  the 
original  QPN  remains  consistent  after  performing  the  truth-preserving  operation. 

Inference  in  QPNs — as  in  influence  diagrams — is  accomplished  by  applying  sequences  of 
two  truth-preserving  operations:  node  reduction  and  link  reversal.  A  chance  node  is  eligible 
for  reduction  iff  it  has  at  most  one  direct  successor.  Decision  and  value  nodes  can  be  reduced 
simply  by  splicing  them  out  and  deleting  any  dangling  links.  A  link  between  chance  nodes 
is  eligible  for  reversal  as  long  as  there  is  no  other  directed  path  between  the  two  nodes 
(otherwise,  the  reversal  would  introduce  a  cycle).  We  define  these  operations  by  specifying 
the  qualitative  relations  holding  in  the  modified  QPN  as  a  function  of  those  in  the  original 
model. 

Suppose  node  c  is  eligible  for  reduction  in  G  and  let  G'  be  the  QPN  obtained  by  reducing 
c.  The  nodes  of  G'  are  just  the  nodes  of  G  minus  c.  If  c  has  no  successors  in  G,  then  G' 
maintains  all  qualitative  relations  of  the  original  save  those  incident  on  c.  Otherwise,  c  has 
exactly  one  successor;  call  it  d.  In  this  case,  all  qualitative  relations  except  those  incident 
on  c  or  d  are  retained  in  G'.  Let  60iC  represent  the  direction  of  influence  between  variables 
a  and  c  in  G,  and  similarly  <5{a,6},c  the  synergy  of  a  and  6  on  c.  The  modified  influence  of  a 
variable  a  on  d  in  G'  is  6'ad,  where 

C  =  ^®(*a.c0M-  (2) 

The  operators  ®  and  ®  denote  sign  addition  and  multiplication,  respectively.  Thus,  the 
influence  update  equation  (2)  sanctions  chaining  of  qualitative  influences,  as  long  as  the  sign 
obtained  from  a  traversal  of  the  path  through  c  agrees  with  the  existing  link,  if  any,  from 
a  to  d.  If  the  two  paths  disagree  (or  one  is  “?”),  6'a  d  is  assigned  the  “?”  sign.  If  a  and  d 
were  not  directly  linked  in  G  ( Saj  =  0,  the  identity  element  for  ®),  then  G'  will  include  an 
influence  determined  solely  by  the  path  through  c. 

The  procedure  for  updating  synergy  links  after  reduction  is  similar,  although  there  are 
more  factors  to  consider.  For  every  pair  of  variables  a  and  b  that  have  influences  on  c  or  d 
in  (7,  the  modified  synergies  are  computed  as  follows: 

—  *^{a,6}.d  ©  (^{a./’J.c  ®  ^ c,d )  ©  b,c  ®  ^{o.cj.rf)  ©  (^a.c  ®  ) •  ('!) 

Figure  3  illustrates  the  situation  before  the  update.  In  typical  applications  of  reduction,  not 
all  of  the  links  shown  are  actually  present  in  G.  In  such  cases,  some  of  the  terms  in  the 
above  equation  simply  drop  out. 
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Figure  3.  A  fragment  of  QPN  G  before  performing  a  QPN  transformation.  The  relations  on  d 
holding  in  G'  after  reducing  c  from  the  network  are  given  by  the  update  equations  (2)  and  (3). 
Reversing  the  link  from  c  to  d  results  in  G",  described  by  equations  (4-6). 


The  second  operation  on  QPNs  is  link  reversal.  Suppose  we  wish  to  reverse  the  influence 
link  from  c  to  d  in  G,  obtaining  the  modified  QPN  G".  G"  retains  all  the  variables  of  G,  as 
well  as  all  of  the  qualitative  relations  not  incident  on  c  or  d.  Using  Bayes’s  theorem,  it  is 
possible  to  show  that  the  qualitative  relation  of  d  on  c  in  the  reversed  network  has  the  same 
sign  as  the  original  link:'1 

C  =  ('1) 

(Since  QPNs  are  acyclic,  G"  cannot  contain  a  link  from  c  to  d.)  For  any  variable  a  that  had 
a  link  to  c  or  d  in  G,  the  new  qualitative  influences  are  as  follows. 

C  =  *«.c  ©  {6a,d  ®  ?) 

K,d  =  (5) 

We  also  must  modify  any  synergy  involving  a  on  these  variables. 

^{a,6),ct  —  ^{a,b},d  ©  (^{a,6},c  ©  & c,d )  ©  (^a,c  ®  ©  {^b.c  ®  (b) 

Synergies  on  c  in  G"  are  assigned  “?”  because  more  specific  signs  are  not  determinable  from 
the  pre- reversal  qualitative  relations. 


5  Query  Processing 

The  purpose  of  inference  in  QPNs  is  to  determine  qualitative  relations  entailed  by  those 
specified  in  the  original  network.  By  applying  sequences  of  the  truth-preserving  operations 

4 Derivation  for  this  as  well  as  all  other  update  rules  are  presented  elsewhere  [12]. 
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above,  we  can  t  ransform  t.lic  original  QPN  to  one  where  the  relations  of  interest  are  direct . 
For  example,  repeated  reductions  and  reversals  transform  the  generic  QPN  of  Figure  I  to 
that  of  Figure  2,  where  the  qualitative  relations  of  the  action  and  information  on  value  are 
explicit. 

A  QPN  query  specifies  j,  the  target  node ,  and  a  set  of  conditioning  variables ,  I\ .  In  the 
example  just  mentioned,  j  is  the  value  node  and  K  =  {ad,  info}.  The  qvrrij- processing  task 
is  to  arrange  the  two  QPN  operations  in  a  sequence  such  that  the  relations  of  I\  on  j  are 
direct  after  executing  the  transformation.  The  qualitative  relation  of  I\  on  j  is  said  to  be 
direct  if  the  following  conditions  are  satisfied: 

1 .  prcd(j)  C  A',  and 

2.  j  $  prcd’(k),  for  all  k  6  K, 

where  pred(n)  is  the  set  of  direct  predecessors  of  node  n,  and  pred'(n)  the  set  of  all  prede¬ 
cessors,  direct  and  indirect  (that  is,  nodes  with  a  directed  path  to  n). 

Under  these  conditions,  the  influences  and  synergies  incident  on  j  directly  encode  the 
QPN’s  constraints  on  the  probability  distribution  for  j  given  K .  The  first  condition  rules  out 
the  presence  of  extraneous  conditioning  variables.  The  second  ensures  that  any  k  G  I\  not 
a  direct  predecessor  of  j  is  conditionally  independent  of  the  target  given  the  rest  of  I\  [12, 
Lemma  4.1].  Jn  the  transformed  network,  Stj  denotes  the  sign  of  the  qualitative  influence 
from  k  to  j  in  context  K  —  {&},  and  6{k',k2),j  the  qualitative  synergy  of  k 1  and  k 2  on  j  in 
context  K  —  {A-1,  k2}. 

To  process  the  query  j  =  v,  I\  =  {act, info)  for  the  generic  QPN,  we  could  reduce  the 
nodes  observe,  costs,  and  benefits  (in  any  order),  then  reverse  the  link  from  0  to  info,  and 
finally  reduce  0.  At  each  step,  the  object  of  the  operation  is  eligible  for  reduction  or  reversal 
in  the  intermediate  network. 

For  any  QPN  query,  in  fact,  there  exists  a  sequence  of  truth-preserving  operations  that 
produces  a  network  in  which  the  relation  of  I\  on  j  is  direct.  This  proposition  follows  from 
Shachter’s  analogous  result  for  numeric  influence  diagrams  [9],  since  the  prerequisites  for 
reductions  (at  most  one  successor)  and  reversals  (must  not  create  a  cycle)  are  identical. 
For  that  matter,  we  can  obtain  a  query-processing  algorithm  for  QPNs  simply  by  adapting  a 
known  inference  procedure  for  numeric  influence  diagrams,  replacing  the  probability  revision 
equations  with  the  QPN  update  rules  of  Section  4. 

However,  the  qualitative  probabilistic  inference  problem  deserves  special  analysis,  as  it 
differs  from  its  numeric  counterpart  in  two  significant  areas.  First,  unlike  numeric  influence 
diagram  evaluation,  QPN  query  evaluation  is  sensitive  to  the  sequence  of  operators  applied  to 
uncover  the  direct  relation.  Although  the  solutions  are  consistent  across  different  sequences, 
the  strength  of  the  results  may  vary.  For  instance,  one  sequence  of  operations  might  produce 
bk)  =  +  while  another  result^  in  ambiguity,  6k }  =  ?. 
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Second,  complexity  considerations  for  the  two  typos  of  model  are  quite  distinct.  In 
contrast  to  manipulation  of  numeric  influence  diagrams,  the  efficiency  of  QPN  inference  is 
insensitive  to  the  cardinality  of  the  outcome  sets  for  each  variable.  (In  fact,  qualitat  ivo  models 
need  not  even  specify  these  outcome  sets.)  To  measure  the  complexity  of  a  QPN  query- 
processing  algorithm,  we  merely  count  the  number  of  sign  additions  and  multiplications 
necessary  to  perform  the  required  network  transformation. 

In  the  following  sections,  I  explore  these  issues  and  their  implications  for  QPN  inference 
algorithms.  Note  that  many  of  the  results  presented  are  independent  of  the  particular  prob¬ 
abilistic  definitions  for  qualitative  influences  and  synergies  specified  above;  they  are  valid  for 
any  qualitative  relations  that  combine  according  to  the  algebra  of  Section  4.  (However,  there 
are  conditions  for  which  these  definitions  are  necessary  for  the  kind  of  qualitative  inference 
we  desire  [12].) 


6  Information  Loss 

A  numeric  influence  diagram  represents  the  unique  joint  probability  distribution  over  the 
variables  in  the  network.  The  reduction  and  reversal  operators  preserve  information,  pro¬ 
ducing  an  influence  diagram  representing  the  unique  projection  of  the  original  distribution 
onto  the  new  variable  set. 

Because  qualitative  relations  enforce  inequalities  on  conditional  probabilities  rather  than 
unique  numeric  values,  a  QPN  represents  a  set  of  probability  distributions  consistent  with 
the  constraints.  QI’N  transformations  are  sound  in  that  all  qualitative  relations  in  the 
modified  network  are  entailed  by  the  original  constraints.  However,  graphical  inference  is 
not  complete ;  there  may  be  distributions  consistent  with  the  modified  network  that  fail  to 
satisfy  the  original  qualitative  relations. 

For  example,  suppose  a  and  c  both  have  positive  links  to  d  and  are  marginally  indepen¬ 
dent.  Reversing  the  link  from  c  to  d  yields  a  QPN  with  a  “?”  link  from  a  to  c,  by  (5).  The 
modified  network  no  longer  captures  the  unconditional  independence  of  a  and  c.  Worse,  if  we 
reverse  the  link  back  using  the  same  update  rule,  the  “?”  remains.  Though  the  two  reversals 
should  ideally  be  inverse  operations,  applying  them  in  sequence  considerably  weakens  the 
network’s  constraint  on  the  joint  probability  distribution. 

Unfortunately,  we  cannot  remedy  this  by  strengthening  the  individual  QPN  update 
rules  (4  6);  each  produces  the  strongest  constraint  expressible  within  the  qualitative  prob¬ 
ability  algebra.  The  problem  is  that  the  set  of  qualitative  signs  {  +  ,—,0,?}  is  not  closed 
under  the  network  transformation  operators.  In  the  example  above,  an  ambiguous  relation 
(“7”  the  vacuous  constraint)  is  introduced  because  no  other  qualitative  conclusion  is  valid. 
The  act  ual  class  of  distributions  consistent  with  the  original  qualitative  relations  is  not  char¬ 
acterized  by  any  of  the  qualitative  signs  available.  (Distributions  allowed  by  “?"  but  not 
consistent,  with  the  original  constraints  are  analogous  to  the  spurious  solutions  produced  by 
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qualitative  reasoning  mechanisms  in  general  [4].)  Not,  surprisingly,  a  similar  problem  arises 
in  oilier  representations  admitting  partial  information  about  probability  distributions.  For 
instance,  Fertig  and  Breese  found  that  the  class  of  lower  probability  functions  is  not,  closed 
under  influence  diagram  transformations  [3]. 

The  following  result  states  that  the  potential  for  information  loss  in  reversal  is  in  fact 
realized  whenever  non-degenerate  predecessors  exist — in  other  words,  whenever  there  is  in¬ 
formation  to  lose.5 

Theorem  6.1  Information  is  lost  in  reversing  the  link  from  c  to  d  unless  all  predecessors 
of  c  or  d  have  “?”  influences  to  both  nodes  and  “?”  synergies  with  any  other  predecessors. 

The  following  result  establishes  that  information  can  be  lost  in  reduction  as  well. 

Theorem  6.2  Information  is  lost  in  reducing  c  from  the  network  iff  there  are  nodes  a  and 
d  such  that: 

•  At  least  one  of  a  and  d  is  non-binary ,  and 

•  <*>«,,<  €  {  +  ,-},  and 

•  ba.c  C5  bC  ({  =  0 
where  0  is  sign  negation. 

When  the  direct  link  between  a  and  d  is  of  opposite  sign  from  the  path  through  c,  5?(a.d)  is 
the  result  even  though  not  all  distributions  Fd\a  are  consistent  with  the  original  constraints. 

7  Minimizing  Ambiguity 

7.1  Order  Dependence 

Given  the  prospect  of  information  loss,  processing  queries  by  graphical  manipulation  may 
not  lead  to  the  strongest  possible  conclusions  about  the  qualitative  relations  among  variables 
of  interest.  Moreover,  different  strategies  (sequences  of  operations)  for  processing  a  given 
query  may  difTer  in  information  lost  and  consequently  yield  solutions  of  varying  strength. 

Consider  the  QPN  of  Figure  4.  Two  simple  transformations  for  revealing  the  direct 
relation  of  I\  —  { Ar 1 ,  /c 2 }  on  j  are: 

1.  reverse  y  — ►  j\  reduce  y\  reverse  j  —+  kl,  and 

2.  reverse  y  kx\  reduce  y. 

r,Proofs  of  these  results  are  provided  in  Appendix  A. 
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I  Jot.  1 1  procedures  yield  the  relation  S~(k\j).  The  first,  however,  is  ambiguous  about  the 
relation  between  k2  and  j,  whereas  the  second  yields  the  more  precise  influence  ,S’+.  Note 
that  both  are  valid;  the  transformation  rules  are  sound  probabilistic  inferences.  However,  the 
conclusions  returned  by  the  first  procedure  are  weakened  by  the  information  loss  incurred 
upon  reversal  of  the  link  from  y  to  j. 


Figure  4.  The  choice  of  links  to  reverse  before  reducing  y  determines  the  strength  of  the  final 
result.. 

C liven  a  QPN  and  a  query,  we  are  clearly  interested  in  finding  the  transformation  sequence 
that  produces  the  strongest  conclusions.  If  computational  efficiency  were  not  an  issue,  we 
could  perform  all  possible  transformations,  returning  the  strongest  qualitative  relations  found 
among  all  variables  of  interest.  Such  an  approach  straightforwardly  implemented,  however, 
would  require  time  exponential  in  the  size  of  the  network.  Since  each  individual  query- 
processing  transformation  can  be  applied  in  polynomial  time,  an  algorithm  based  on  a  single 
operator  sequence  would  be  preferred.  Ideally,  we  would  like  such  an  algorithm  to  apply  the 
sequence  of  operators  minimizing  ambiguity  in  the  end  result. 

The  problem,  then,  is  to  find  a  method  to  compute  this  ambiguity-minimizing  sequence, 
if  it  exists,  given  a  QPN  and  a  query.  Although  it  would  be  convenient  to  derive  this 
transformation  from  structure  alone,  it  is  not  possible  to  do  so,  as  the  transformation  yielding 
the  strongest  results  may  depend  on  the  signs  of  the  qualitative  relations  in  the  network. 
(“Structure”  distinguishes  only  between  zero  and  nonzero  influences.) 

The  network  of  Figure  5  demonstrates  the  sign-dependence  of  ambiguity  minimization. 
Processing  a  query  with  I\  =  {A’1, . . . ,  A'4}  requires  the  reduction  of  y  from  the  network  along 
with  some  reversals.  If  b  =  +  ,  the  transformation  sequence 

reverse  j  — >  y\  reverse  y  -a  A'3;  reverse  y  — >  A'4;  reduce  y 

yields  the  network  with  S+{k',j),  i  =  1,...,4.  If  <5  =  ?,  the  same  sequence  results  in  “?” 
influences  from  each  k'  to  j.  The  ambiguity  is  spurious  for  A-1  and  A-2,  however,  as  revealed 
by  the  transformation 
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reverse  y  — ►  k3\  reduce  y;  reverse  j  — ►  A:3;  reverse  j  — *  k4. 

'I'lie  latter  sequence,  on  the  other  hand,  returns  S!(k3,j)  when  6  =  +  .  In  fact,  no  sequence 
of  reductions  and  reversals  applied  to  this  network  yields  the  strongest  results  in  both  cases. 


Figure  5.  The  optimal  transformation  sequence  depends  on  whether  6  =  ?. 

The  sign-dependence  of  ambiguity  minimization  is  troubling,  because  in  general  deter¬ 
mining  the  sign  of  a  particular  relationship  could  require  query  processing  for  an  arbitrary 
subnetwork.  Nevertheless,  at  present  it  is  an  open  question  whether  an  efficient  algorithm 
exists  for  minimizing  ambiguity  taking  the  signs  of  qualitative  relations  into  account.  This  is 
true  even  for  the  special  case  where  all  influences  are  “?”  links.  Smith  considers  the  problem 
of  maintaining  information  about  conditional  independence  under  graphical  transformations, 
concluding  that  the  problem  of  minimizing  spurious  ambiguity  remains  open  [10]. 

The  existence  of  an  ambiguity-minimizing  sequence  is  itself  uncertain.  There  may  be 
cases  (I  know  of  none  but  have  no  proof  of  their  impossibility)  where  no  single  sequence  of 
transformations  produces  results  as  strong  as  the  union  of  all  transformations.  And  even 
if  an  ambiguity-minimizing  sequence  exists  in  this  sense,  it  remains  possible  that  stronger 
results  could  be  achieved  using  other  transformation  operators  or  perhaps  an  entirely  different 
inference  method. 

7.2  Transformation  Invariants 

Despite  these  unanswered  questions,  there  is  still  much  we  can  say  about  the  inferential 
properties  of  various  query-processing  strategies.  The  following  propositions  characterize 
the  legality  of  certain  patterns  in  sequences  of  operations  and  establish  invariants  among 
classes  of  sequences. 

Proposition  7.1  Reducing  a  node  does  not  negate  the  eligibility  of  any  other  node  for  re¬ 
duction  nor  any  link  for  reversal ,  but  may  establish  such  eligibility.  Reversing  an  arc  from 
c  to  d  negates  the  eligibility  of  both  nodes'  predecessors  for  reduction,  possibly  negates  d's 
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eligibility,  and  possibly  renders  c  eligible.  All  links  to  c  become  ineligible  for  reversal,  some 
links  to  d  may  become  eligible,  and  some  other  links  may  become  ineligible. 

Proposition  7.2  If  two  reductions,  or  a  reduction  and  a  reversal,  can  legally  be  applied  in 
either  order,  the  resulting  networks  are  identical.  This  is  not  true  for  two  reversals  unless 
the  nodes  involved  are  disjoint. 

A  corollary  of  this  result  is  that  node  removals  (reversal  of  all  outgoing  links  followed  by 
reduction)  are  order- in  variant  if  the  nodes  are  not  directly  connected  and  share  no  successors. 
The  next  proposition  characterizes  a  less  restrictive  form  of  invariance  holding  for  reductions. 

Proposition  7.3  Any  sequence  consisting  exclusively  of  reductions  is  equivalent  to  any  other 
legal  sequence  of  the  same  reductions. 

Proposition  7.4  Reducing  a  node  c  is  equivalent  to  reversing  its  outgoing  link  (if  any),  then 
reducing  it. 

This  last  proposition  is  not  surprising  considering  the  correspondence  between  update 
rules  for  the  two  operations  (compare  (2)  and  (3)  with  (5)  and  (6)).  It  implies  that  we  can 
simplify  matters  by  only  considering  reduction  of  nodes  with  no  successors.  Since  reducing 
these  barren  nodes  [8]  entails  no  loss  of  information  (by  Theorem  6.2)  and  does  not  restrict 
further  operations  (Proposition  7.1),  these  may  as  well  be  performed  as  early  as  possible. 
Propositions  7.5  and  7.6  generalize  this  conclusion. 

Proposition  7.5  To  process  a  query  ( j ,  K),  any  node  y  without  a  directed  path  to  j  or  some 
k  G  /\  can  be  summarily  spliced  from  the  network  without  updating  the  remaining  links. 

Proposition  7.6  Any  node  may  be  reduced  as  soon  as  it  is  eligible  without  increasing  the 
ambiguity  of  the  transformation. 

Treating  the  reductions  as  automatic,  the  transformation  construction  task  becomes  es¬ 
sentially  a  matter  of  selecting  the  best  sequence  of  reversals.  The  remaining  proposition 
determines  this  best  sequence  for  a  special  case. 

Proposition  7.7  If  a  node  y  G  pred(j)  is  to  be  reduced  after  reversing  all  but  one  outgoing 
link,  the  link  not  reversed  should  be  y  — >  j . 

The  value  of  these  propositions  is  that  they  provide  constraint  on  the  set  of  minimally 
ambiguous  transformations.  One  can  use  them  to  generate  a  single  admissible  transfor¬ 
mation,  or  within  a  search  procedure  to  prune  the  space  of  operator  sequences  that  need 
be  considered.  For  example,  Proposition  7.6  dictates  that  the  query-processing  algorithm 
should  greedily  reduce  nodes  not  in  I\  U  {j},  whereas  Proposition  7.3  establishes  that  a 
search  algorithm  may  neglect  transformations  differing  only  in  reduction  order  from  one  to 
be  performed. 
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7.3  Upper  Bounds 

Because  the  coverage  of  these  propositions  is  incomplete,  graphical  inference  still  requires 
heuristic  choice  of  operators  and/or  a  combinatorial  search.  Fortunately,  we  can  often  limit 
the  search  by  computing  bounds  on  the  optimal  query-processing  strategy.  The  following 
results  characterize  some  situations  where  isolated  features  of  the  network  can  dictate  the 
best  results  achievable  by  any  sequence  of  transformation  operations. 

Definition  3  A  free  path  from  x  to  y  with  respect  to  K  is  a  sequence  N  of  nodes  indexed 
by  ( 1 , . . . ,  m)  such  that 

•  1  <  z  <  m, 

•  i^i'  =►  N(i)  ±  N(i'), 

•  7^  0,  for  all  i  €  {2,. . .  ,z}, 

•  f>N(i),N(i+ 1)  ^  0.  for  all  i  €  -  1},  and 

•  N(  1)  =  2,  N(m)  —  y,  N(i)  £  I<  for  1  <  i  <  m. 

Node  N(z)  is  called  the  pivot  of  N . 

In  other  words,  x  and  y  are  connected  by  a  series  of  distinct  nonzero  links  with  no  pair 
pointing  into  the  same  node  and  no  links  traversing  nodes  in  K.  Note  that  the  node  indexed 
c  could  be  x  or  y,  in  which  case  there  is  a  directed  path  between  the  two  nodes. 

Proposition  7.8  Consider  a  network  G  with  a  free  path  N  from  j  to  k,  k  6  A'.  Any 
transformation  of  G  processing  the  query  ( j,K )  will  result  in  S8(k,j),  where 

&  =  (  ®  <W),N(t-l)  ®  &  £/V(.),/V(.+l)  )  0<5g, 

\2<t<z  z  <  i  <  m  —  1  J 

and  ha  depends  on  the.  rest  of  the  network. 

Proposition  7.8  establishes  an  upper  bound  for  queries  based  on  a  straight  forward  compu¬ 
tation.  Let  bN  be  the  sign  product  of  the  influences  along  a  free  path  N.  By  the  proposition, 
the  exact  result  of  the  query  is  6N  ®  SG  for  some  6G.  By  the  properties  of  0,  this  value  can 
be  no  “stronger”  than  6/v  in  the  ordering  of  qualitative  signs  (zero  provides  the  strongest 
constraint;  “?”  the  weakest).  Let  paths(x,y,  K)  be  the  set  of  all  free  paths  from  x  to  y  with 
respect  to  K.  Then  an  upper  bound  for  the  upon  processing  the  query  (j,  A  )  can  be 
found  by  computing 

Nepaths[j.k,K) 
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In  particular,  finding  a  free  path  N  with  6^  =  ?  (indeed,  finding  a  free  path  with  any  “?" 
link)  indicates  that  “?”  is  an  upper  bound  for  the  relation  of  interest.  The  same  is  entailed 
by  the  existence  of  two  paths  N\  and  N2  such  that  <3/v,  =  +  and  b^.2  =  — .  In  these  cases, 
the  bound  is  exact,  as  ambiguity  is  the  weakest  qualitative  relation.  An  upper  bound  of  +  , 
in  contrast,  entails  only  that  the  result  is  in  the  set  {  +  ,?}. 

The  following  propositions  describe  two  general  situations  where  ambiguity  is  inevitable, 
even  if  the  original  network  contains  no  “?”  links. 

Proposition  7.9  Suppose  the  network  contains  free  paths  N\  from  j  to  k'  and  N2  from  k 
to  k',  k,  k'  6  K ,  with  pivots  indexed  by  z\  and  z 2,  such  that 

1.  (disjoint  prefix)  A^j ( z i )  7^  N2(i2)  for  all  i\  <  Cj  and  i2  <  z2,  and 

2.  (convergence)  both  N\  and  N2  end  with  links  into  k'  (that  is,  A'^^i)  7^  k1  ^  N2(z2)). 

Then  any  transformation  of  this  network  processing  the  query  (j,I\)  will  result  in  S?(k,j). 

Proposition  7.10  Suppose  the  network  contains  two  free  paths  from  j  to  k,  k  E  I\,  denoted 
N\  and  N2  with  corresponding  pivots  indexed  by  Z\  and  z2>  such  that  ATi(ri)  £  Ar2  and 
N2(z2)  (£  N\ .  Then  any  transformation  of  this  network  processing  the  query  (j,  K)  will  result 
i"  S!(k,j). 

7.4  More  Open  Questions 

The  bounds  given  by  the  propositions  above  are  not  generally  sharp,  and  do  not  cover  all  sit¬ 
uations.  Further  transformation  principles  are  necessary  to  select  which  reversal  to  perform 
in  situations  where  several  are  available.  Experience  with  various  algorithms  suggests  that 
upstream  reversals  are  generally  preferred  (since  information  loss  tends  to  get  propagated 
upstream  anyway);  however,  we  currently  lack  precise  results  to  this  effect. 

It  remains  to  be  seen  whether  optimal  algorithms  can  be  identified  for  special  cases  of 
QPNs  and  queries.  For  example,  queries  with  singleton  K  (that  is.  the  relation  of  k  on 
j  given  nothing  else)  may  be  more  amenable  to  analysis  than  the  general  problem.  One 
property  commonly  exploited  in  numeric  probabilistic  networks,  single-connectivity,  does 
not  appear  to  offer  much  advantage.  Note  that  the  networks  of  Figures  4  and  5  are  both 
singly  connected. 

8  Algorithmic  Complexity 

As  mentioned  above,  any  query  can  be  processed  in  time  polynomial  in  the  size  of  the 
network.  This  result  follows  from  the  analysis  of  Rege  and  Agogino's  “topological”  solution 
of  numerical  influence  diagrams  [6],  with  an  extra  quadratic  factor  for  updating  synergies. 
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In  fact,  any  non-redundant  transformation  sequence  (one  without  multiple  reversals  of  the 
same  link)  can  be  applied  in  polynomial  time. 

This  conclusion  highlights  the  disparity  in  computational  properties  betw<vn  qualitative 
and  numeric  influence  diagrams.  For  the  latter,  complexity  is  highly  sensitive  to  the  size  of 
each  variable’s  value  set  (see,  for  example,  the  analysis  of  Ezawa  [2]),  a  factor  irrelevant  in 
the  qualitative  realm.  For  some  intractable  numeric  diagrams  (those  solvable  by  reductions 
alone  including  the  3SAT  construction  used  by  Cooper  [l]  to  demonstrate  the  NP-hardness 
of  numeric  query  processing),  the  efficient  QPN  transformation  is  guaranteed  to  be  minimally 
ambiguous. 

The  straightforward  comparison  is  misleading,  however,  because  graphical  inference  in 
QPNs  is  incomplete.  Although  determining  qualitative  relations  is  often  easier  than  deter¬ 
mining  precise  numeric  relations,  this  is  not  necessarily  true  in  the  general  case.  A  more 
definitive  statement  awaits  further  results  on  minimally  ambiguous  transformations,  the  com¬ 
plexity  of  transformation  search,  and  the  abstract  qualitative  probabilistic  inference  problem. 


9  Summary 

QPNs  provide  a  representation  for  the  qualitative  relationships  among  variables  without 
requiring  excessively  precise  assessments.  Like  numeric  influence  diagrams,  QPNs  support 
inference  via  graphical  manipulations.  Because  qualitative  models  incompletely  specify  the 
joint  distribution  among  variables,  information  is  not  always  preserved  by  these  manipu¬ 
lations.  In  designing  a  query-processing  algorithm,  minimizing  qualitative  ambiguity  is  of 
paramount  concern. 

I  have  established  several  properties  of  QPN  transformations  that  enhance  the  strengt  h  of 
derived  relations  or  limit  the  search  necessary  to  ensure  minimal  ambiguity.  Unfortunately, 
there  is  no  procedure  for  deriving  a  minimally  ambiguous  transformation  sequence  based 
solely  on  the  topology  of  a  QPN. 

As  indicated  above,  several  open  questions  remain  regarding  the  nature  and  even  exis¬ 
tence  of  optimal  query-processing  algorithms.  In  addition  to  these,  several  issues  in  qual¬ 
itative  probabilistic  reasoning  are  worthy  of  further  investigation.  For  example,  inference 
methods  based  on  macro-  or  entirely  new  transformation  operators  may  offer  advantages 
over  sequences  of  reductions  and  reversals.  Inference  approaches  not  based  on  graphical 
transformations  should  also  be  explored.  Finally,  extensions  of  the  qualitative  representat  ion 
itself  ([1  1,  Section  7.5])  will  necessitate  continued  work  on  inferential  aspects  of  qualitative 
probabilistic  networks. 
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A  Proofs 

The  notat  ion  for  probability  distributions  follows  [12]. 

Theorem  6.1  Information  is  lost  in  reversing  the  link  from  c  to  d  unless  all  predecessors  of 
c  or  d  have  “?”  influences  to  both  nodes  and  synergies  with  any  other  predecessors. 

PROOF  First  we  must  establish  that  no  information  is  lost  when  the  stated  condition  is 
met.  If  all  predecessors  have  “?”  influences  and  synergies  on  both  c  and  d ,  the  effect  of  two 
reversals  is  to  leave  the  network  unchanged.  Thus,  the  QPN  after  two  reversals  admits  the 
same  set  of  joint  distributions.  Since  reversal  is  a  sound  inference  operator  (all  distributions 
consistent  with  the  model  prior  to  reversal  are  also  consistent  post-reversal),  each  individual 
reversal  must  also  have  preserved  the  set  of  consistent  distributions. 

To  demonstrate  information  loss  when  the  conditions  are  not  met,  I  construct  spurious 
distributions  corresponding  to  each  distinct  case.  In  doing  so,  I  simplify  matters  by  taking 
c  and  d  to  be  binary.  This  generalizes  directly  to  multi-valued  variables  by  constructing 
distributions  over  the  larger  domains  with  the  probability  mass  allocated  according  to  some 
binary  criterion. 


case  1:  6a  c  /  ?.  Suppose  that  8ac  —  SCt<i  =  +  (the  other  sub-cases  are  analogous).  Thus, 
in  the  original  network,  ax  >  a2  implies 

Pr(C|ajx)  >  Pr(C|a2x),  (7) 

for  all  contexts  x  (instantiations  of  other  predecessors  of  e).  The  reversal  introduces  d  as  a 
predecessor  of  c. 

Pr(C|a,x)  =  Pr(C|a,Dx)  Pr(D|a,x)  +  Pr(C|cq/)x)  Pr(Z)|a,x).  (8) 

Let  a,  =  Pr(/-)|a,x).  Consider  a  distribution  with 


Pr(C]a,Dx)  = 
Pr(C|a,£>x)  < 

Pr(C\a2Dx)  > 
Pr(C|a2/;x)  = 


P2, 

«2. 


/  02(1-01), 

max(at ,  — ; - ),  and 


1  -  o2 


02(1  -  Qi) 
1  —  o2 


Such  a  distribution  satisfies  the  both  of  the  post-reversal  constraints  (S'JC  =  -f  and  8" c  =  ?), 
as  !}r(( ’|«/}.r)  >  Pr(C|oDx)  for  either  value  of  a.  However,  applying  these  inequalities  to 
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ilu*  expansion  of  Pr(f'|«,-x )  (8),  we  find  t.lia.1. 

Pr(C|«ix)  <  a2at\  +  <*2(1  —  ftj),  while 
Pr(C|a2x)  >  a,a2  +  — 7" - —  (l-«2). 

After  simplifying,  the  two  inequalities  directly  contradict  the  original  constraint  based  on 
(ia  r  =  +  (7).  Thus,  the  QPN  after  reversal  admits  a  strictly  greater  class  of  distributions 
than  the  original. 


case  2:  6at<i  ?.  As  a  representative  sub-case,  suppose  6Q)£f  =  Scj  =  -f.  These  original 
constraints  dictate  that  Pv{D\a\Cx)  >  Pr(£)|a2Cx)  for  ax  >  a2.  Using  Bayes’s  theorem, 


Pr(D|ajCx) 


Pr(C|a,£)x)  Pr(D|a,x) 

Pr(C|a,Dx)  Pr(Z}ja,x)  +  Pr(C)a,Z)x)  Pr(D|a,x) 


Suppose  that 


Pr(qa,Dx)  Pr(C|a2Dx) 
Pr(C|aiDx)  “  Pr(C|a2£>x) 


This  is  consistent  with  the  constraints  after  reversal,  as  long  as  a  >  1  (to  sat’sfy  S'JC  =  +  ). 
Then 

Pr(fl|  a.Cz)  =  n,  -al;r(Pkl>  (9) 

o  Pr(/?|a,x)  +  1  —  Pr(D|a,x) 


The  link  from  a  to  d  allows  the  possibility  that  Pr(Z)|a]x)  <  Pr(D|a2x),  which,  together 
with  (9),  would  imply  that 

Pr(  .D^Cx)  <  Pr(Z)|a2Cx), 


violating  the  original  constraint. 


case  3:  <${a,6},c  ^  ?.  Assume  that  both  a  and  b  have  “?”  influences  on  both  c  and  d ;  the 
other  possibilities  are  covered  by  the  cases  above.  Further,  let  us  take  <5{0,(,},c  =  6cj  —  + 
(similar  arguments  will  work  for  the  other  combinations).  The  positive  synergy  implies 

Pr(C|G!6ix)  —  Pr(C|a26ix)  >  Pr(C|ai62x)  —  Pr(C|a262x),  (10) 

for  «|  >  a2  and  6j  >  b2.  After  reversal,  d  becomes  a  predecessor  of  c  and  there  is  no 
synergy  constraint  among  these  variables.  Consider  a  joint  distribution  for  {o,6,  c,  d)  where 
Pr(«i ,bi,C,F))  —  2/17  and  Pr (a,b,c,d)  =  1/17  for  all  other  values  of  the  variables.  Such  a 
distribution  is  consistent  with  the  post-reversal  constraints  (6'JC  =  +)  but  fails  to  satisfy  (10). 


ft 


ft 
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case  4:  b{„,b),d  7^  We  need  only  cover  cases  where  a  and  b  have  iiilluences  on  both 
r  and  d,  and  “?"  synergy  on  c.  Suppose  as  a  representative  sub-case  that  =  Km  =  +■ 

I'hus,  we  must  satisfy 

PrfDla^iCx)  —  Pr(/)|a2&icx)  >  PT(D\alb2cx)  —  Pr(£)|a262rx). 

However,  the  distribution  of  case  3  also  violates  this  constraint,  though  it  is  permitted  by 
the  post-reversal  network. 

case  5:  b{a,c}M  7^  ?•  For  ^{a,c},<i  =  Km  =  +,  the  distribution 

Pr(a,,  C,  D)  =  2/9, 

Pr(a,c,  d)  =  1/9,  all  other  values 

is  a  spurious  consequence  of  reversal  just  as  in  the  previous  two  cases. 

Since  these  cases  exhaust  the  condition  of  the  theorem,  reversal  in  all  but  degenerate 
situations  necessarily  admits  spurious  probability  distributions.  [  | 

Theorem  6.2  Information  is  lost  in  reducing  c  from  the  network  iff  there  are  nodes  a  and 
d  such  that: 

•  At  least  one  of  a  and  d  is  non-binary ,  and 

•  Km  €  {  +  ,-  },  and 

®  britc  ^ '  b(~M  0  bfi'd, 
where  O  is  sign  negation. 

Phoof 

(if):  I  demonstrate  information  loss  for  a  case  with  a  non-binary,  6a<c  —  Km  —  +»  and 
brj  =  — .  The  other  combinations  satisfying  the  conditions  can  be  argued  analogously.  After 
reducing  c,  the  QPN  provides  no  constraint  on  the  probabilistic  relation  between  a  and  d, 

as  Km  =  ?.  '  |  . ’ 

When  c  and  d  are  binary,  the  probability  distribution  for  d  given  a  can  be  written  as 
Pr(D|a,x)  =  Pr(D|a,Cx)  Pr(C|a,x)  +  Px(D\atC  x)  Pr(C|a,x), 
or,  with  symbols  substituted  for  corresponding  terms: 

Pr(/)|a,x)  =  a,/?,  +  7,(1  -  /?,).  (11) 
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Consider  an  ascending  sequence  of  values  for  a:  at  <  a2  <  a3  <  a4.  1  show  that  the 

distribution 

Pr(Z?|a,x)  =  2/3, 1/3, 2/3, 1/3  for  i  =  1,2, 3, 4  (12) 

is  inconsistent  with  the  original  constraints,  though  it  is  allowed  by  the  reduced  network. 
Since  o,  <  7,  (8r,{  =  — ),  it  follows  from  (11)  that 

o,  <  l>r(D|aI.r)  <  7,.  (13) 


rl’h<’refore  71  >  2/3.  And  since  both  a,  and  7,  are  nondecreasing  in  1  (by  8a %<2  =  +),  72  is 
also  at  least  2/3.  Given  that  a,  is  nonnegative,  we  can  use  (11)  to  bound  ft  from  below: 


ft  >  1 


Pr(£)|a,x) 

Hi 


Thus,  ft  >  1  —  (l/3)/(2/3)  =  1/2.  The  inequality  holds  for  ft  as  well,  as  8a,c  =  +•  We  use 
this  lower  bound  in  turn  to  derive  a  bound  on  a.  Rearranging  (11),  we  get 

7,  —  Pr(T)|a,x) 


Noting  that  2/3  <  73  <  1  and  ft  >  1/2,  we  can  use  this  to  deduce  that  03  >  1/2.  As 
nr,  is  nondecreasing,  a4  is  also  bounded  from  below  by  1/2.  But  this  contradicts  the  first 
inequality  of  (13),  as  Pr(Dja4x)  =  1/3.  Therefore  the  distribution  of  (12)  violates  the  original 
constraints,  and  information  has  been  lost  in  reducing  c. 


(only  if):  I  show  that  each  of  the  conditions  is  necessary  for  information  loss  by  establish¬ 
ing  that  violating  any  one  of  them  guarantees  that  all  distributions  consistent  post-reduction 
are  also  consistent  pre-reduction. 

condition  1:  one  of  a  and  d  non-binary.  As  a  representative  case,  take  c  =  6a j  —  + 
and  8C'j  —  — .  For  simplicity  assume  c  is  binary,  though  it  would  do  just  as  well  to  consider 
it  multi-valued  with  all  intermediate  values  having  zero  probability.  After  reducing  c,  all 
probability  distributions  for  a  and  d  are  consistent  with  the  network.  Since  both  variables 
are  binary,  the  post-reduction  distribution  is  fully  specified  by  Pr(D|Ax)  and  Pr(D|>4x). 
These  probabilities  are  related  to  the  original  constraints  as  follows: 

Pr(r|/lx)  =  Pv{D\ACx)  Pr(C|Ax)  +  Pr(D\AC x)  Pr(C'|/4x)  (Id) 

Vr(D\Ax)  =  Px{D\ACx)  Pr(C|Ai)  +  Pi{D\ACx)  Pr((7|Ar).  (15) 

The  question  is  whether  all  values  for  the  LHS  probabilities  can  be  obtained  from  some 
assignment  to  the  RHS  terms  that  is  consistent  with  the  specified  qualitative  influences. 


* 


4 
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Consider  the  assignments 


* 


t 


?r(0\ACx)  =  0 
Pr(D|46\r)  =  Pv{C\Ax)  =  1. 

These  extreme  values  guarantee  that  any  probabilities  assigned  to  the  other  terms  will  satisfy 
the  inequalities  entailed  by  <5„iC,  6aj,  and  6Cid-  In  addition,  let  Pr(C|j4:r)  =  0.  Then  the 
equations  (14)  and  (15)  reduce  to: 

Pr(D\Ax)  =  Pr{D\ACx) 

Pr(D|4x)  =  Pr(£>|^Cx). 

Since  the  RHS  terms  are  unconstrained  by  the  other  assignments,  the  qualitative  influences, 
or  each  other,  any  distribution  for  d  given  a  and  x  alone  is  consistent  with  the  original 
constraints. 


condition  2:  6ad  £{  +  ,—}.  If  dad  =  0,  the  distributions  consistent  with  the  network  after 
reducing  c  are  those  satisfying  the  constraint  imposed  by  SatC  <g>  6c<d.  Suppose  that  <$a  c  =  +. 
Then  the  qualitative  influence  of  a  on  d  after  reduction  is  simply  Scj.  One  cumulative 
distribution  satisfying  the  constraint  6a<c  =  -f-  is6 


Fc(co|«x) 


1  if  cq  >  a 
0  otherwise. 


(IK) 


Since  a  and  d  are  conditionally  independent  given  c  ( 6a ^  =  0),  the  distribution  for  d  given  a 
after  reduction  is: 

Fd{d0\ax  )  =  J  Fd(do\c0x)dFc{c<)\  ax). 

Substituting  in  (16),  we  get 

Fd(d0\ax )  =  Fd{d0\cx)\c=a, 

where  the  distribution  for  d  given  c  is  evaluated  at  c  —  a.  Thus,  the  distributions  allowed 
after  reduction  for  a  and  d  are  equivalent  to  those  allowed  by  6cj,  which  is  the  qualitative 
influence  obtained  by  reduction. 

The  other  violation  of  condition  2  occurs  when  Sa<d  =  ?.  In  this  case  the  distribution 
post-reduction  is  unconstrained.  Indeed,  no  constraint  is  warranted,  as  any  distribution  is 
achievable.  Regardless  of  6cj,  the  possibility  that  c  and  d  are  independent  is  consistent,  in 
which  case  the  relation  between  a  and  d  is  constrained  only  by  da  d,  which  is  to  say  not  at 
all. 

6 For  ( 16)  to  he  well-defined,  variables  a  and  c  must  have  identical  domains.  A  more  complicated  argument 
is  required  when  the  domains  differ. 
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condition  3:  6aiC  ®  6cd  =  Q  6a  d.  Suppose  condition  3  were  violated  while  condition  2 
were  not.  Then  6a  d  agrees  with  6a<c  ®  SCid  and  therefore  persists  as  the  qualitative  relation 
between  a  and  d  after  reducing  c.  As  above,  since  conditional  independence  of  c  and  d 
is  always  consistent,  the  distribution  post  reduction  can  be  no  more  constrained  than  its 
pre-reduction  counterpart.  Thus,  no  information  is  lost. 

□ 

Proposition  7.1  Reducing  a  node  does  not  negate  the  eligibility  of  any  other  node  for  re¬ 
duction  nor  any  link  for  reversal,  but  may  establish  such  eligibility.  Reversing  an  arc  from 
c  to  d  negates  the  eligibility  of  both  nodes’  predecessors  for  reduction,  possibly  negates  d’s 
eligibility,  and  possibly  renders  c  eligible.  All  links  to  c  become  ineligible  for  reversal,  some 
links  to  d  may  become  eligible,  and  some  other  links  may  become  ineligible. 

Proof 

reduction/reduction:  Recall  that  a  node  is  eligible  for  reduction  iff  it  has  at  most  one 
successor.  After  reducing  node  c,  c’s  predecessors  lose  c  as  a  successor  but  inherit  c’s  successor 
if  there  is  one.  Since  every  node’s  successor  list  either  stays  the  same  or  decreases  by  one, 
previously  eligible  nodes  remain  so.  If  c  has  no  successors,  any  predecessor  linked  only  to  c 
becomes  eligible  for  reduction. 

reduction/reversal:  A  link  is  eligible  for  reversal  if  doing  so  would  not  create  a  cycle. 
In  other  words,  one  can  reverse  the  link  from  c  to  d  if  there  is  no  indirect  directed  path 
connecting  the  two.  Since  reducing  a  node  creates  no  new  paths,  the  operation  cannot  spoil 
reversibility  for  any  links.  (Of  course,  links  incident  on  the  reduced  node  are  eliminated  by 
the  operation.)  If  the  only  path  denying  the  reversibility  of  c  —*  d  happens  to  be  c  — ►  e  — ♦  d, 
however,  reducing  e  would  render  c  — +  d  eligible  for  reversal. 

reversal/reduction:  Upon  reversing  the  link  from  c  to  d,  each  node  gains  the  other’s  pre¬ 
decessors.  Each  of  these  predecessors  thus  has  at  least  two  successors  and  hence  is  ineligible 
for  reduction.  Since  d  gains  c  as  a  successor,  it  becomes  ineligible  if  it  previously  had  exactly 
one.  Similarly,  c  loses  d  as  a  successor,  so  it  becomes  eligible  if  previously  it  had  exactly  two. 
Eligibility  for  other  nodes  is  unaffected. 

reversal/reversal:  After  the  reversal,  any  predecessor  of  c  has  an  indirect  path  to  c  via 
d,  thus  ruling  out  reversal  of  its  link  to  c.  Pre  reversal  links  to  d  are  rendered  eligible  by 
th  e  reversal  just  in  case  the  only  indirect  path  connecting  its  edges  was  through  c.  Links 
connecting  other  nodes  may  become  ineligible,  for  example  if  one  of  the  new  predecessor 
links  completes  a  path  through  c.  No  other  links  may  become  eligible  for  reversal,  as  any 
indirect  paths  canceled  by  the  reversal  are  reinstated  by  inheritance  of  predecessors. 

□ 
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Proposition  7.2  If  two  reductions,  or  a  reduction  and  a  reversal,  can  legally  be  applied  m 
either  order,  the  resulting  networks  are  identical.  This  is  not  true  for  two  reversals  unless 
the  nodes  involved  are  disjoint. 

Proof 

two  reductions:  Suppose  we  are  interested  in  the  qualitative  relation  between  a  and  d 
after  reducing  c1  and  c2.  Since  the  QPN  is  acyclic,  assume  without  loss  of  generality  that 
d  (£  predm(a)  and  c2  ^  pred’(c1).  With  these  constraints,  the  only  configurations  of  interest 
have  the  nodes  to  be  reduced  between  a  and  d — otherwise  the  reductions  will  not  affect  the 
relation  of  interest.  Thus,  the  relevant  network  fragment  has  the  general  form  of  Figure  6. 
Some  of  the  links  shown  may  be  optional  (that  is,  8  =  0).  In  fact,  at  least  one  of  612  and 
8\d  must  be  zero  in  order  to  satisfy  the  requirement  that  c1  be  legally  reducible. 


Figure  6.  General  QPN  fragment  for  analyzing  the  order-invariance  of  reducing  c1  and  c2. 


Suppose  we  reduce  c1,  then  c2.  Let  8'  denote  the  qualitative  influence  of  y  on  z  after 
reducing  c'.  Applying  the  update  rule  for  reduction  (2), 

4  =  ^<*4  ©  (^a.l  ®  h\ 4 ) 

^a,2  =  ^a,2  ©  (<^a,l  ®  ,2 ) 

2  4- 


8\4  =  8. 


Reducing  c2  yields 


C  =  C©(^,  2®*W)- 


After  substituting  the  expressions  for  81  above, 

h, lj  =  8atd  ®  (6a,i  ®  8\td)  ©  ([^0,2  ®  (<5a,l  ®  ^1.2)]  ©  £>74) 

—  8a  d  ©  (<5a>  1  ®  8\  d)  ®  (8a< 2  ®  824)  ©  (<5a,l  ®  ,2  ®  ^24)- 


(17! 
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If  instead  r2  were  reduced  first,  the  intermediate  results  would  be 

*1,1  -  K,,t  <!>(/>„, 2  C0/>2,,/) 

^,1  =  #«.i 

=  ©  (^1,2  ®  ^2,d)- 

Subsequently  reducing  c1  yields 

=  ®  (^a,2  ®  ^2,i)  ®  (£a,l  ®  $l,d)  ®  (^a,l  ®  <$1,2  ®  ^2.^), 

which  is  equivalent  to  the  expression  for  6*'d  above  (17). 

To  complete  the  argument  we  need  to  establish  that  the  invariance  holds  for  synergies  as 
well.  Consider  another  node  b  with  the  same  outgoing  links  as  a  in  Figure  6.  By  an  exercise 
similar  to  that  above  (though  more  complicated7),  we  can  derive  the  general  expression  for 
the  qualitative  synergy  of  a  and  b  on  d  after  reducing  c1  followed  by  c2: 

^\a,b},d  ~  6{a,b},d  ®  (<5{a,6},l  ®  ^1  ,d)  ®  (<>6,1  ®  <5{a,l},d)  ®  (<^a,l  ®  <^{fc,l},rf) 

®  (<^{a,6},2  ®  <^2 ,d)  ®  (<^{a,fc},l  ®  <*>1,2  ®  ^2,d)  ®  (6fc,l  ®  ^{a,l},2  ®  <*> 2,d ) 

®  (^a,l  ®  <5{f>,l},2  ®  <^2 ,d) 

®  (bb,2  ®  ^{a,2},d)  ®  (<^6,2  ®  ^o,l  ®  ^{2.1  >,ti )  ®  (<^6,1  ®  ^t.2  ®  <^{a,2}.d) 

®  (^6,1  ®  ^1,2  ®  ^a,l  ®  <5{2,l},d)  (18) 

®  (ba,2  ®  ^{6,2},d)  ®  (ba,2  ®  &  ’  ^*{ 2.!  ],d)  ®  (<5a,l  ®  <$1 ,2  ®  <5{6,2},rf) 

P  (-o,l  ®  ^1,2  ®  ^6,1  ®  <^{2.1}, d)-  (19) 

Note  that  lines  (18)  and  (19)  are  zero  and  thus  drop  out,  since  at  least  one  of  <5i,2  and  6\ti 
(and  therefore  <${2,1  },d)  must  be  zero. 

It  turns  out  that  performing  the  calculation  with  c2  reduced  first  results  in  the  same 
synergy  expression.  Thus,  d  =  6^^  d  and  reduction  is  pairwise  order- in  variant. 

reduction  and  reversal:  Consider  the  operations  to  reduce  y  and  to  reverse  c  —>  d.  In 
order  for  the  operations  to  be  applicable  in  either  order,  it  must  be  the  case  that: 

•  y  c  and  y  ^  d  (else  the  link  would  not  exist  post-reduction)  and 

•  y  $  pred(c)  and  y  £  pred(d)  (see  Proposition  7.1). 

Civen  these  constraints,  the  updates  for  the  two  operations  affect  a  disjoint  set  of  qualitative 
relations.  Hence,  the  results  are  identical  when  performed  in  either  order. 

7The  calculations  required  to  verify  these  propositions  were  substantially  aided  by  a  program  for  manip¬ 
ulating  QPNs  with  qualitative  relations  specified  by  symbolic  expressions. 
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two  reversals:  Reversals  are  not  order-invariant  in  general.  In  fact,  even  the  topology  of 
the  resulting  network  depends  on  the  order  reversals  are  performed.  For  example,  suppose 
the  only  non-zero  links  in  a  network  are  c1  — ►  d  and  c2  — >  d.  The  orientation  of  the  link 
connecting  the  c's  after  the  two  reversals  depends  on  which  is  performed  first.  (The  sign  on 
the  link  is  “?”  in  either  case.) 

If  the  two  arcs  reversed  do  not  share  a  node,  the  two  operations  affect  a  disjoint  set  of 
relations.  In  this  case,  the  results  do  not  depend  on  the  order  of  the  reversals. 

□ 

Proposition  7.3  Any  sequence  consisting  exclusively  of  reductions  is  equivalent  to  any  other 
legal  sequence  of  the  same  reductions. 

PROOF  Let  o  be  a  sequence  of  nodes  such  that  it  is  legal  to  reduce  any  node  in  the  sequence 
after  all  of  its  predecessors  have  been  reduced.  For  convenience,  index  the  nodes  by  their 
order  in  o,  so  we  can  write  a  =  (l,...,m).  Consider  o',  another  legal  sequence  of  node 
reductions  containing  the  same  nodes  as  o,  though  possibly  in  a  different  order.  Without 
loss  of  generality,  suppose  that  the  first  node  of  a  (node  I)  is  not  first  in  er'  (if  it  is,  we  need 
only  compare  the  tails  of  the  sequences  anyway).  Thus,  o'  =  (s,i,l,/),  where  s  and  t  are 
subsequences  of  nodes. 

Now  consider  a"  =  ( s ,  l,i,t),  the  sequence  obtained  from  o'  by  swapping  nodes  i  and  1 
in  the  order.  First  we  must  consider  the  the  legality  of  o"  as  a  sequence  of  reductions. 

1.  The  subsequence  s  is  obviously  legal,  as  it  constitutes  the  first  part  of  the  legal  sequence 
ah 

2.  The  legality  of  o  implies  that  it  is  legal  to  reduce  node  1  first ,  that  is,  in  the  starting 
network.  Therefore  it  must  be  legal  to  reduce  node  1  after  performing  the  sequence 
s  of  reductions,  as  reduction  alone  cannot  eliminate  a  node’s  eligibility  for  reduction 
(Proposition  7.1). 

3.  Similarly,  if  it  is  legal  to  reduce  node  i  after  s  (as  in  o'),  it  must  also  be  legal  to  reduce 
it  after  (,s,  I ). 

Thus,  we  have  established  that  it  is  legal  to  reduce  nodes  (s,  l,i)  in  that  order.  Proposi¬ 
tion  7.2  dictates  that,  for  any  given  network  in  which  both  orders  are  legal,  reductions  ( ? ,  1 ) 
and  (l,i)  have  the  same  result.  Therefore,  since  the  prefixes  are  identical,  reduction  se¬ 
quences  (s,?,  1)  and  (s,  l,i)  leave  the  network  in  the  same  state.  Appending  t  then  obviously 
preserves  the  identity,  and  we  have  established  that  o'  and  o"  are  equivalent. 

By  repeatedly  swapping  node  1  with  its  neighbor  to  the  left,  we  can  eventually  produce  a 
sequence  in  which  it  appears  first.  Because  each  step  preserves  both  legality  and  equivalence, 
the  resulting  sequence  yields  results  identical  to  o'.  Having  done  this,  we  can  then  proceed 
to  move  node  2  to  the  left  in  the  same  fashion,  until  it  is  second  in  the  list.  Repeating  this 
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fur  all  nodes  in  tin*  series,  we  achieve  in  tin*  end  t,lie  sequence  a,  thus  demonstrating  that  it 
is  equivalent  to  a'.  Since  a  and  o'  were  unrestricted,  we  have  established  the  equivalence  of 
all  legal  reduction  sequences  containing  the  same  set  of  nodes. 


Proposition  7.4  Reducing  a  node  c  is  equivalent  to  reversing  its  outgoing  link  (if  any),  then 
reducing  it. 

PROOF  Let  d  be  the  sole  successor  of  c  and  let  a  and  b  be  generic  predecessors  of  c  and/or 
d.  (If  there  is  no  d  the  equivalence  is  trivial.)  Compare  the  states  of  the  network  obtained 
by  reducing  c  and  reversing  c  — ►  d.  The  correspondence  of  (2)  and  (3)  with  (5)  and  (6) 
establish  that  8'ad  =  8"  d  and  ^{a,b},d  -  ^{a,b},d-  In  contrast  to  the  reduction,  however,  the 
reversal  may  also  result  in  other  updates  to  influences  and  synergies  on  c.  But  since  these 
updates  are  rendered  moot  by  subsequent  reduction  of  the  now-barren  c,  the  ending  states 
are  identical.  □ 

Proposition  7.5  To  process  a  query  ( j ,  K),  any  node  y  without  a  directed  path  to  j  or  some 
k  €  K  can  be  summarily  spliced  from  the  network  without  updating  the  remaining  links. 

PROOF  By  Theorem  6.2,  no  information  is  lost  in  reducing  barren  nodes.  Furthermore,  the 
network  after  reduction  does  not  depend  on  the  links,  if  any,  into  the  barren  node.  We  can 
establish  the  proposition  by  demonstrating  that  all  nodes  satisfying  the  stated  conditions 
can  be  removed  by  a  sequence  consisting  solely  of  barren  node  reductions. 

Note  that  any  barren  node  not  in  {/}  U  K  trivially  satisfies  the  conditions  for  y  in  the 
proposition.  Reducing  all  of  these  nodes  has  the  same  effect  as  summarily  splicing  them  from 
the  network.  After  these  reductions,  some  other  nodes  may  have  become  barren.  Suppose 
we  continue  to  reduce  all  barren  nodes  outside  of  {j}  U  K  until  no  more  exist.  At  that  point, 
all  nodes  meeting  the  conditions  for  y  have  been  removed.  To  see  this,  observe  that  any 
y  must  either  be  barren  or  have  a  barren  successor  (since  the  network  is  acyclic).  As  the 
process  ends  only  when  there  are  no  barren  nodes  not  in  {/}  U  I\,  y  and  all  other  nodes 
satisfying  the  proposition  must  have  been  removed.  □ 

Proposition  7.6  Any  node  may  be  reduced  as  soon  as  it  is  eligible  without  increasing  the 
ambiguity  of  the  transformation. 


PROOF  Let  c  be  the  node  eligible  for  reduction,  and  d  its  predecessor.  First,  note  that 
the  only  loss  of  information  attributable  to  reduction  is  in  case  there  is  some  a  linked  to 
both  c  and  d  such  that  the  path  from  a  to  d  via  c  and  the  direct  link  are  of  opposite 
signs  (Theorem  6.2).  In  particular,  no  dependence  information  is  lost.  This,  combined  with 
the  fact  that  reductions  cannot  nullify  the  eligibility  of  subsequent  operations,  implies  that 
reducing  c  immediately  is  an  optimal  policy  with  respect  to  the  information  encoded  in  the 
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topology  of  the  network.  Thus,  wo  need  consider  only  the  potential  introduction  of  spurious 
ambiguity  due  to  premature  reduction. 

Next,  let  us  examine  the  propagation  of  sign  information  in  transformation  operations. 
Upon  reduction  of  c,  the  new  qualitative  influence  is  given  by 

*  S'  =  8a,d  ©  (6„fC  ®  Sc,j)-  (20) 

This  information  can  be  propagated  to  other  relations  via  operations  on  adjacent  nodes. 
Specifically,  there  are  four  ways  8'  can  have  an  impact  on  the  signs  of  other  relations. 

1.  Reduction  of  a,  d ,  or  a  node  that  is  a  direct  successor  of  a  and  a  predecessor  of  d. 

2.  Reversal  of  a  — »  d. 

3.  Reversal  of  a  link  x  — ►  d. 

4.  Reversal  of  a  link  d  — »  x. 

In  the  case  of  reduction,  8'  appears  as  one  of  the  terms  in  a  recursive  invocation  of  (20),  with 
the  node  names  rebound.  By  the  reasoning  of  Proposition  7.8,  information  lost  in  8'  can 
never  be  recovered  by  iteration  of  this  formula.  In  a  reversal  of  a  — ►  d,  all  links  into  a  are 
updated  by  the  same  formula,  and  the  same  argument  applies.  Links  into  d  are  converted 
to  “?”  regardless  of  8 ',  so  the  information  loss  had  no  adverse  effect.  The  same  is  true  of 
influences  affected  by  x  — >  d  reversals:  the  a  — »  d  link  is  updated  by  simple  chaining  and 
a  — ♦  x  is  indiscriminately  made  ambiguous.  Upon  reversing  d  —+  x,  a  —+  d  is  either  “?”  or 
unchanged,  while  a  —>  x  is  updated  as  if  d  were  reduced. 

The  updates  above  follow  a  general  pattern.  The  actual  sign  of  8'  matters  only  for 
chaining  relations:  links  arranged  in  a  head-to-tail  adjacency.  Therefore,  we  have  established 
that  the  information  lost  in  reduction  does  not  propagate  in  any  but  the  inevitable  directions. 
Given  that  there  is  no  loss  of  dependence  information,  therefore,  nothing  can  be  gained  by 
postponing  reduction.  Q 

Proposition  7.7  If  a  node  y  £  pred(j)  is  to  be  reduced  after  reversing  all  but  one  outgoing 
link,  the  link  not  reversed  should  be  y  — ►  j . 

0  PROOF  Suppose  y  has  outgoing  links  to  each  of  x  and  j ,  and  let  b  be  an  arbitrary  node 

with  a  free  path  to  y.  By  Proposition  7.5,  we  can  assume  without  loss  of  generality  that  x 
has  a  directed  path  to  j  or  some  k'  £  K .  If  the  path  is  to  j ,  then  y  — *  j  is  not  reversible  and 
the  conclusion  of  the  proposition  follows  by  necessity.  (If  j  has  a  directed  path  to  x,  then 
c  — >  x  is  not  reversible  and  the  choice  presumed  by  the  Proposition  docs  not  apply.)  Thus, 
x  has  a  directed  path  to  k' . 

Note  that  both  b  and  j  have  free  paths  to  k'  via  y.  At  this  point,  Proposition  7.9  does  not 
apply,  as  the  two  paths  violate  its  disjoint  prefix  condition  ( y  is  the  pivot  of  the  path  from 
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j).  Upon  reversal  of  y  — >  j,  however,  the  pivot  is  eliminated  and  this  condition  is  satisfied. 
The  implication  of  the  reversal,  then,  is  that  any  further  query  processing  must  result  in  a 
“?”  link  between  b  and  j.  This  ambiguity  is  not  inevitable  for  a  reversal  of  x  —*  j\  hence, 
this  operation  is  preferred.  [_] 

Proposition  7.8  Consider  a  network  G  with  a  free  path  N  from  j  to  k,  k  E  K .  Any 
transformation  of  G  processing  the  query  ( j,I\  )  will  result  in  S6(k,j),  where 


6  = 


\2 <i<z 


2  < l<  m  —  1 


®  SG  i 


and  6g  depends  on  the  rest  of  the  network. 


(21) 


PROOF  We  establish  the  result  by  examining  the  effect  of  reductions  and  reversals  on  the 
composition  of  the  free  path.  The  proof  proceeds  by  induction  on  the  number  of  operations 
required  for  the  transformation.  Take  as  the  base  case  the  set  of  transformations  implemented 
by  zero  steps.  These  correspond  to  the  degenerate  free  path  where  j  is  connected  directly  to 
k  and  the  query  is  answerable  by  inspection  of  the  original  network.  In  this  situation,  the 
multiplicative  chain  (21)  reduces  to  a  single  term  Sjtk,  and  SG  is  zero.  Now  hypothesize  that 
all  transformations  up  to  a  specified  size  yield  results  of  the  form  of  (21).  We  show  that  any 
additional  operations  preserve  that  form,  as  all  expressions  arising  off  the  main  path  can  be 
additively  factored  out  into  the  SG  term. 

Let  us  adopt  the  notation  N(i  ±  1)  to  denote  the  successor  of  N(i)  along  path  N,  whose 
index  depends  on  whether  i  is  less  than  or  greater  than  the  pivot,  z.  Reduction  of  a  node 
w  &  N  can  affect  a  link  from  N(i)  to  N(i  ±  1)  iff  N(i)  is  a  predecessor  of  w  and  N{i  ±  1)  is 
its  sole  successor.  In  this  case,  the  sign  on  the  link  becomes 

^v(.),/v(«± i)  =  ^v(.),N(;± i)  ©  <S>  <5u,,yv(t±i))-  (22) 

By  the  inductive  hypothesis,  the  result  of  the  rest  of  the  transformation  has  the  form  of  (21), 
which  we  can  write  as 

(^/v(i),/v(i± i)  ®  bn)  ©  (23) 

taking  Sr  to  denote  the  product  of  links  from  the  rest  of  the  chain.  We  can  plug  in  the 
right-hand  side  of  (22)  and  multiply  through,  yielding 

[SN{i),N(i±\)  ®  Sr)  ©  [(<$AT(,)iU;  (g>  5u„Ar(i±l)  <S>  Sr)  ©  (5c], 

which  satisfies  the  required  form,  with  the  expression  in  brackets  substituted  for  the  previous 
SG. 


4 


J 
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Next,  consider  the  reduction  of  a  node  N(h),  h  >  z.  We  can  establish  that  the  new 
sequence  N'  —  (/V(l N(h  —  1),  N(h  +  1 N(m))  satisfies  the  conditions  for  a  free 
path.  First,  note  that  all  links  are  unchanged  except  for8 

=  f’N(h-l),N(h+l)  ©  ®  f>N(h),N(h  +  \ )  )•  (24) 

Therefore,  the  free  path  is  still  fully  connected  by  nonzero  links,  and  the  segment  from  N(z) 
to  N(rn)  remains  directed.  Again  by  the  inductive  hypothesis,  the  query  result  takes  the 
form  of  (23),  with  i  =  h  —  1.  Substituting  the  expression  from  (24)  yields 

Multiplying  through  results  in 

{f>N(h-l),N(h)  ®  f>N(h),N(h+l)  ®  $ R )  ©  [(<5jV(/>-l),/V(A+l)  ®  ^/t)  ©  fe], 
again  satisfying  the  required  form. 

The  same  conclusion  holds  by  symmetry  for  reduction  of  N(h),  h  <  z.  The  pivot  N(z) 
cannot  be  reduced  because  it  has  more  than  one  successor. 

By  a  similar  argument,  reversing  a  link  not  on  the  path  preserves  both  the  existence 
of  the  path  and  the  required  form,  since  the  update  formula  for  either  type  of  predecessor 
influence  (5)  includes  the  original  sign  as  an  additive  term.  Suppose,  then,  that  the  link 
from  /V(/i)  to  N(h  +  l)  is  reversed,  for  h  >  z.  We  can  construct  N'  exactly  as  for  the  case  of 
reduction.  The  reversal  introduces  a  new  link  from  N(h  —  1)  to  N(h  +  1),  with  sign  computed 
as  for  the  corresponding  reduction  (24).  By  the  argument  above,  the  form  of  the  result  is 
preserved  for  the  new  path  N'.  A  symmetric  argument  is  valid  for  reversing  the  link  from 
N(h)  to  N(h  —  1),  h  <  z. 

The  argument  needs  to  be  modified  for  reversing  a  link  from  the  pivot  itself.  In  this 
operation  no  links  on  the  path  are  affected  except  the  one  being  reversed,  and  this  one 
maintains  the  same  sign.  After  the  reversal,  however,  the  designation  of  pivot  2  must  be 
transferred  to  the  other  end  of  the  reversed  link  in  order  to  maintain  the  condition  for  a  free 
path.  Since  this  path  retains  the  same  nodes  and  the  same  <5s,  the  inductive  step  follows 
directly. 

Because  each  operation  preserves  the  target  form  (21)  given  that  it  holds  for  the  rest  of 
the  transformation,  the  entire  transformation  must  maintain  this  result.  |  | 

Proposition  7.9  Suppose  the  network  contains  free  paths  N\  from  j  to  k'  and  jV2  frotn  k 
to  k',  k ,  k '  £  A',  with  pivots  indexed  by  z\  and  z2,  such  that 

/.  (disjoint  prefix)  Ni(ii)  ^  A ^(*2)  for  a H  l\  ^  z\  and  12  ^  ^2.  and 

'’Because  the  indices  are  renumbered,  node  N'(i)  corresponds  to  , Y(i  +  1)  in  the  original  sequence,  for 
1  >  h. 
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2.  (convergence)  both  Nx  and.  N 2  end  with  links  into  k'  (that  is,  yV,(;,)  ^  k1  ^  ^2(^2))- 
Then  any  transformation  of  this  network  processing  the  query  ( j,K )  will  result  in  S'f  k.  j ) . 

PROOF'’  The  proof  is  by  induction  on  the  lengths  of  Nx  and  N  In  the  base  case,  |.V,|  = 

| iV2 1  =  1:  each  path  consists  of  a  single  direct  link  into  k' .  To  \  ocess  the  query,  the  path  4 

from  j  to  k1  needs  to  be  removed.  Since  k'  cannot  be  reduced,  this  entails  a  reversal  of  the 

j  — >  k'  link.  If  A-  is  a  predecessor  of  k '  at  the  time  of  the  reversal,  the  operation  introduces  a 

link  of  sign  “?”  from  k  to  j.  This  holds  regardless  of  the  signs  of  the  links  into  k1  (as  long  as 

they  are  nonzero,  a  definitional  property  of  free  paths),  by  the  reversal  update  rule  (5).  The 

only  way  k  would  not  be  a  predecessor  of  k'  at  this  time  would  be  if  the  k  — ►  k'  link  were 

reversed  first.  In  this  case,  a  “?”  link  from  j  to  k  is  necessarily  introduced.  In  either  event, 

there  exists  a  free  path  of  sign  6^  =  ?  from  j  to  k,  and  therefore  Proposition  7.8  implies 

that  the  result  of  the  query  must  be  S?(k,j). 

For  the  inductive  hypothesis,  assume  that  the  result  is  true  for  all  paths  with  jjVJ  <  nx 
and  1 7V2 1  <  n2.  Given  a  chain  with  |  A^i  j  =  nt  -fi 1  or  | N2\  =  n2  +  1 ,  we  show  that  any  operation 
must  have  one  of  the  following  effects: 

1.  to  preserve  the  existence  of  paths  Nx  and  N2,  of  the  same  length,  satisfying  the  condi¬ 
tions  of  the  proposition, 

2.  to  preserve  the  existence  of  such  paths  with  shorter  length,  or 

T  to  introduce  a  free  path  from  k  to  j  with  at  least  one  “?”  link. 

In  the  first  case,  the  situation  is  identical  from  the  standpoint  of  our  proposition  and  the 
process  can  be  repeated.  In  the  second,  we  can  invoke  the  inductive  hypothesis  that  the 
conclusion  holds  for  paths  of  the  shorter  length.  In  the  final  case,  we  can  invoke  Proposi¬ 
tion  7.8  directly,  as  in  the  base  case,  to  establish  the  final  result.  Since  the  first  case  cannot 
be  maintained  through  the  entire  sequence  of  transformations — the  query  cannot  be  fully 
processed  in  that  state- -eventually  one  of  the  other  two  must  prevail,  thus  establishing  the 
inductive  argument. 

first  consider  a  node  reduction.  As  in  the  argument  for  Proposition  7.8  above,  this 
operation  preserves  the  existence  of  free  paths  whether  or  not  the  node  was  previously  on 
one  of  the  paths.  Similarly,  it  cannot  alter  the  disjointness  of  the  path  prefixes  or  the  \ 

convergence  of  their  final  links  into  k'.  If  the  node  to  be  reduced  is  on  one  of  the  paths,  this 
path  is  thereby  shortened,  thus  fulfilling  the  condition  for  case  two  above. 

Reversing  a  link  not  on  one  of  the  paths  cannot  negate  its  existence.  As  argued  above, 
reversing  a  link  on  a  free  path  also  preserves  its  existence,  possibly  at  a  length  one  node 
shorter.  However,  we  must  address  special  attention  to  the  case  where  the  link  reversed 
is  the  final  one  in  the  chain,  that  into  k'.  Reversing  this  link  does  not  obviously  maintain 
the  convergence  condition  of  our  proposition.  Suppose  we  reverse  the  last  link  of  A'i  (the 
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analysis  for  N2  is  symmetric).  There  are  two  relevant  subcases,  depending  on  whether  the 
immediate  predecessor  of  k'  on  /V)  is  the  pivot  node.  Let  yx  be  the  predecessor  of  k'  on  /V) 
and  y2  its  counterpart  on  N2.  If  yx  ^  Nx(zx),  then  the  reversal  introduces  a  new  link  from 
y j’s  predecessor  into  k'  and  a  new  path  N[  is  formed  with  one  node  fewer  than  N\.  A  new 
path  N2  is  also  formed  if  yx  =  y2.  Otherwise,  if  yx  =  Nx(zx ),  the  reversal  introduces  a  link  of 
sign  “?”  from  y2  to  yx.  (By  the  disjointness  condition,  the  two  nodes  cannot  be  identical  if 
one  is  a  pivot.)  With  this  link,  y2  has  a  directed  path  to  k.  Since  the  segment  of  N2  from  j 
to  7/2 is  a  free  path,  so  is  the  extended  path  from  j  to  k  via  y2.  As  noted  above,  the  existence 
of  a  free  path  with  one  “?”  link  is  sufficient  to  establish  that  the  result  of  query  processing 
is  S'!(k,j). 

Because  these  cases  exhaust  the  possibilities,  conditions  are  maintained  or  progressed, 
and  the  inductive  argument  is  concluded.  Q 

Proposition  7.10  Suppose  the  network  contains  two  free  paths  from  j  to  k,  k  6  K,  denoted 
Ari  and  N2  with  corresponding  pivots  indexed  by  zx  and  z2t  such  that  Nx(zx)  N2  and 
A^(  *2 )  ^  A/i .  Then  any  transformation  of  this  network  processing  the  query  (j,  K)  will  result 
771  S7(^,j). 


PltOOF  By  the  arguments  above,  the  existence  of  the  free  paths  is  preserved  by  all  opera¬ 
tions,  and  the  only  operation  that  can  afTect  the  conditions  on  the  pivot  node  are  reversals  of 
links  emanating  from  that  node.  Consider  a  reversal  of  the  link  from  Nx{zx)  to  yx  —  Nx(zx  +  \) 
(the  other  cases  are  symmetric).  If  yx  &  N2,  then  the  conditions  of  the  proposition  are  pre¬ 
served  with  a  change  in  pivot  to  yx.  Suppose  yx  is  on  the  path  N2.  Since  yx  is  also  on  Nx,  we 
know  it  cannot  be  the  pivot  of  N2  and  therefore  it  must  have  one  predecessor,  x2,  on  that 
path.  However,  the  reversal  of  Ar1(zi)  — ►  yx  creates  (or  updates)  a  link  from  x2  to  Nx(zx) 
with  a  “?”  sign.  Since  x2  has  a  directed  path  along  N2  to  either  j  or  k ,  and  Nx(zx)  has 
directed  paths  to  both,  there  must  be  a  free  path  from  j  to  k  with  pivot  x2  that  includes 
this  new  link.  By  Proposition  7.8,  this  is  sufficient  to  entail  ambiguity  of  the  query.  [j 
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